MAISE: A Flexible, Configurable, Extensible Open Source Package for Mass AI System Evaluation
نویسنده
چکیده
The past few years have seen an increasing interest in using Amazon’s Mechanical Turk for purposes of collecting data and performing annotation tasks. One such task is the mass evaluation of system output in a variety of tasks. In this paper, we present MAISE, a package that allows researchers to evaluate the output of their AI system(s) using human judgments collected via Amazon’s Mechanical Turk, greatly streamlining the process. MAISE is open source, easy to run, and platform-independent. The core of MAISE’s codebase was used for the manual evaluation of WMT10, and the completed package is being used again in the current evaluation for WMT11. In this paper, we describe the main features, functionality, and usage of MAISE, which is now available for download and use.
منابع مشابه
Comparison of Open Source Learning Management Softwares and Presenting a Native Evaluation Tool
Introduction: Nowadays all educational institutes are trying to use technology in their structure. This effort has been faced with different barriers, including cost, time, and support. Therefore, using open source softwares can partially help us in using technology. In this article, we review main features of several open source learning management softwares, while presenting a tool which incl...
متن کامل3D Finite element modeling for Dynamic Behavior Evaluation of Marin Risers Due to VIV and Internal Flow
The complete 3D nonlinear dynamic problem of extensible, flexible risers conveying fluid is considered. For describing the dynamics of the system, the Newtonian derivation procedure is followed. The velocity field inside the pipe formulated using hydrostatic and Bernoulli equations. The hydrodynamic effects of external fluids are taken into consideration through the nonlinear drag forces in var...
متن کاملSushi.R: flexible, quantitative and integrative genomic visualizations for publication-quality multi-panel figures
MOTIVATION Interpretation and communication of genomic data require flexible and quantitative tools to analyze and visualize diverse data types, and yet, a comprehensive tool to display all common genomic data types in publication quality figures does not exist to date. To address this shortcoming, we present Sushi.R, an R/Bioconductor package that allows flexible integration of genomic visuali...
متن کاملUsing Computer Games Techniques for Improving Graph Visualization Efficiency
Creating an efficient, interactive and flexible unified graph visualization system is a difficult problem. We present a hardware accelerated OpenGL graph drawing engine, in conjunction with a flexible preview package. While the interactive OpenGL visualization focuses on performance, the preview focuses on aesthetics and simple network map creation. The system is implemented as Gephi, a modular...
متن کاملAn Open-Source Package for Recognizing Textual Entailment
This paper presents a general-purpose open source package for recognizing Textual Entailment. The system implements a collection of algorithms, providing a configurable framework to quickly set up a working environment to experiment with the RTE task. Fast prototyping of new solutions is also allowed by the possibility to extend its modular architecture. We present the tool as a useful resource...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011